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Abstract 

We propose new space- time codes tailored to point-to-multipoint, or broadcast, communications using layered 
source coding. Our codes can be encoded (and decoded) differentially, and they are based entirely on phase-shift 
keying. We develop analytical design criteria for the codes, and we discuss the design of optimal and suboptimal 
receiver structures. We also discuss the relation between our new codes and a differentially encoded Alamouti code. 
Numerical examples illustrate the performance of our new codes. 

Index Terms 

space-time coding, diversity, maximum-likelihood, differential encoding, joint estimation/detection, nonuniform 
constellations, phase-shift keying 

I. Introduction 

During the last decade, wireless systems with multiple transmit and receive antennas have been studied 
extensively, and the performance of such systems has been proven to be extremely promising. The propa- 
gation channel associated with a system with multiple transmit and receive antennas is sometimes called a 
multiple-input multiple-output (MIMO) channel, and the associated coding and signal processing is referred 
to as space-time coding. Using proper space-time coding, it is possible to use the degrees of freedom of the 
MIMO channel both to increase the throughput and to counteract fading. For this reason, MIMO technology 
is believed to become a major cornerstone in many future wireless communication systems. 

Loosely speaking, communication links can be classified into two categories: point-to-point links and 
point-to-multipoint links. In the former case, there is exactly one transmitter and one receiver which com- 
municate with each other at a given time, whereas for a point-to-multipoint link, a transmitted message is 
aimed at multiple different recipients simultaneously. A cellular communication system with mobile users 
is an example of a point-to-point communication system, whereas a radio/TV broadcast is an example of 
a point-to-multipoint link (therefore point-to-multipoint links are sometimes referred to as broadcast chan- 
nels). 

Point-to-multipoint links are becoming increasingly important. For instance, the introduction of Digital 
Audio Broadcast (DAB) and High-Definition Television (HDTV) has pioneered a whole new field of digital 
broadcasting applications. As a further example, it is widely believed that much of the next generation's 
wireless networking will be based on so-called ad-hoc networks, where it may be necessary for multiple 
units to listen to one message at the same time. For some early and fundamental research on broadcast 
channels the reader is referred to [1] and [2]. 

There are two major differences between point-to-point and point-to-multipoint communication links. 
First, a point-to-point connection can be optimized for a given transmitter-receiver pair. For instance, a 
cellular system usually employs power control techniques that adjust the transmitted power to minimize the 
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power consumption, reduce the amount of co-channel interference, and at the same time ensure that the 
received signal strength exceeds a certain threshold. Second, in contrast to a point-to-point communication 
link, for a broadcast transmission all receivers have different capabilities to decode the transmitted message. 
This is so because the different receivers experience in general very different radio link qualities (for instance, 
due to varying propagation conditions). Moreover, since quality can usually be traded for cost, the receivers 
themselves may have different inherent abilities to decode the transmitted message. 

The fact that distinct receivers have different capabilities of decoding a message suggests that the trans- 
mitted signal should consist of several components which are of different importance for the reconstruction 
of the message (and therefore have an inherently different vulnerability to transmission errors). This idea 
has lead to the concept of layered source coding (see, e.g., [3], [4], [5], [6]), which is now a mature technique 
employed in many multimedia standards. For instance, the image coding standard JPEG-2000 and the video 
coding standard MPEG-4 use what is sometimes referred to as "fine granularity scalability," which enables a 
gradual tradeoff between the error-free data throughput and the quality of the reconstructed image or video 
sequence [7]. Such progressive source coding methods are already in use in many Internet applications 
where data rate can be traded for quality, and they are expected to play an instrumental role for the next 
generation of wireless standards to provide ubiquitous access both to the Internet, and to diverse sources of 
streaming video and audio. 

The topic of this paper is space-time coding for broadcast channels when layered source coding is used. 
Previous works on the topic include [8], which discussed "beamforming" techniques (hence requiring feed- 
back of channel state information) for a multiuser system. Reference [9] proposed a layered space-time 
coding scheme assuming different receivers may have different numbers of receive antennas, and that a 
receiver with more receive antennas can decode more layers of messages. In this paper, we suggest a new 
layered space-time coding scheme that does not require any channel knowledge at the transmitter, and which 
is constructed starting from a criterion that attempts to minimize the error rate. Our new codes also satisfy 
two other important design goals. First, they can be encoded (and detected) differentially. Second, they are 
entirely based on phase-shift keying (PSK) modulation and consequently the transmitted signal has constant 
envelope at all times. The codes proposed in this paper can therefore be seen as a multidimensional extension 
of a technique in [10]. 

II. Nonuniform SISO PSK Codes 

With layered source coding, for instance as used in JPEG-2000 and MPEG-4, more capable receivers can 
achieve a higher data rate by decoding the messages contained in all layers while the less capable receivers 
can only decode the message in the bottom layer. Thus the "most important" information should be conveyed 
via the bottom layer while "less important" information can be transmitted via higher layer(s). 

An example of a modulation scheme designed for layered source coding is nonuniform phase-shift-keying 
modulation (PSK), which was first introduced by Pursley and Shea [10] (see also [11] and [12]). The 
idea is intuitively appealing. Starting with a standard uniform PSK constellation, which contains data for 
the "basic" layer, a nonuniform PSK code constellation is obtained by adding a usually small additional 
phase shift a to each original constellation point, which contains the information for the "additional layer. 
This is illustrated in Figure 1, when the basic message is conveyed via a QPSK constellation. A capable 
receiver with high signal-to-noise-ratio (SNR) can distinguish among all eight constellation points which 
means that both the basic and the additional layer messages can be decoded, whereas to a less capable 
receiver the constellation may appear like a blurred QPSK constellation and hence it may only be able to 
distinguish between the different "large" phase shifts; thus only the basic message can be detected accurately. 
By such a construction the error rates associated with the basic and the additional message are different. 
Consequently, the additional message can carry information that is of less importance for the reconstruction 
of the transmitted message than the basic message is. The error probability of the additional message can 
be easily adjusted by choosing different values of the additional phase shift a. Obviously, a larger a results 
in a better performance for the additional message at the cost of a deterioration of the performance for the 
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basic message. The main virtue of this nonuniform PSK encoding scheme is that the signaling envelope is 
constant while multi-layered messages can be transmitted with different and adjustable error performance. 

III. Space-Time Coding 

Consider a MIMO system with n t transmit antennas, n r receive antennas, and assume for simplicity that 
the propagation channel is linear, time-invariant and frequency flat. Let H be an n r x n t matrix whose 
(m, n)th element contains the channel gain between transmit antenna n and receive antenna ra, and suppose 
that a code matrix X of dimension n t x N is taken from a matrix constellation X and transmitted by 
sending its N columns via the n t antennas during N time epochs. If the signal received during the same N 
time intervals is arranged inann r x matrix Y, then: 

Y = HX + E (1) 

where E is an n r x N matrix of noise. Throughout this paper, we shall make the somewhat standard 
assumption that the elements of H and E are independent and zero-mean complex Gaussian with variances 
p 2 and a 2 , respectively; hence the channel is Rayleigh fading. 

Given Y, detection of X in a maximum-likelihood (ML) sense amounts to minimizing the Euclidian 
distance 

\\Y-HX\\ (2) 

with respect to X 6 X, as well as H (unless it is known). The average (over H) probability that the ML 
detector mistakes a transmitted code matrix X 0 for an incorrect (and different) code matrix X ^ X 0 can 
beboundedby [13],[14], [15]: 

E[P(X 0 - X)}\^ <\(X- X 0 )(X - X 0 )T nr • (^)" nrni ( 3 ) 

and 

E[P(X 0 -+ X)) < \X 0 Tli H X»\- n ' ■ ({jY™ (4) 

noncoherent \ £ ±U / 

for the two cases that the channel H is known and unknown to the receiver respectively. Here | • | denotes 
the determinant of a matrix and (•)** stands for the complex conjugate transpose. 

Note that if H is unknown, we can obtain a detection rule that depends only on X. Consider the problem 
of minimizing (2): 

min \\Y-HXf (5) 

Treating H as a deterministic unknown, we can minimize this cost function with respect to H. The result 
is (see, e.g., [16] and [15]): 

H = YX H {XX H y l (6) 

Inserting (6) into (5) yields: 

min \\Y-YX H (XX H )- l Xf^mm \\YU^ H f (7) 

where 

U x = X(X H X)~ 1 X H 



stand for the projections onto the range space of X and its orthogonal complement, and J is the identity 
matrix. 
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IV. Differential Modulation for MIMO Systems 

Space-time modulation matrices that can be encoded differentially are often of special interest since such 
codes can easily be demodulated noncoherently. Differential codes with uniform error probabilities have 
been studied by many authors (see, e.g., [17], [18], [19], [20], [21] for some prominent examples) and some 
of them can be seen as an extension of differential PSK to MIMO systems. In general, if t is the time index 
and if {U(t)} is a sequence of (square) information-bearing matrices, then differential encoding obtains the 
transmitted code matrix X(t) at time t via: 



X(t) = X(t - l)U(t) 



(9) 



where X(t — I) is the matrix transmitted at time t — 1. Such an encoding usually is only meaningful under 
certain circumstances, for instance if U(t) is unitary (in which case X(t) becomes unitary for all t as well, 
given a unitary "initial matrix" X(0)). By considering two (or more) received matrices Y(t) and Y(t - 1) 
simultaneously, noncoherent detection is possible. For example, if we concatenate the matrices received at 
time t — 1 and t, and assume that the channel H remains constant over these two time intervals (to within 
practical accuracy), we can write (using (1) and (9)): 



pr(t-i) Y(t)] 

-- [HX(t - 1) HX(t)] + [E{t - 1) E{t)] 
*[HX(t-l) HX(t-l)U(t)] + [E(t-l) E(t)] 
--HX{t -!)•[/ U(t)] + [E(t - 1) E(t)] 



(10) 



where HX(t - 1) can be seen as an unknown "effective" channel matrix and [I U(t)] is an effective code 
matrix. Provided that [I U(t)] is such that the corresponding determinant in (4) is nonzero, noncoherent 
demodulation is possible. By applying the decision rule (7) to the data model (10), the detection rule for 
differential demodulation becomes: 



mm 

min Tr 

U(t)€X 



min Tr 

U(t)€X 



[Y(t-i) Y(t)].n{j u(t)]H 

[[Y(t-1) Y(t)].Ui 
|[Y(t-l) Y(t)] 



I 

-u H (t) 

max R e {Tv{U(t)Y H (t)Y(t-l)} } 



Y H {t - 1) 
Y H (t) 

-U(t) 



} 



Y*(t - 1)1 1 
Y»{t) J J 



(11) 



where Tr{ } stands for the trace of a matrix and where we used the fact that the information-bearing matrix 
U(t) is unitary. The associated average error probability, i.e., the probability that C/ (0) is wrongly detected 
as U for this differentiation detection scheme can be bounded by using (4): 



e [p(um - 10] < I [/ c/ (0) ] • nfi U]H 



i 



/ (? \-n r n t 



= \l-i(UU^ H + U^U H ) 
2 



o2 x — *T 



/ \-n r n t 



(12) 



For simplicity hereafter the time index t of U(t) is omitted whenever no confusion can occur. 
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V. Nonuniform Space-Time Codes 

Inspired by the nonuniform PSK codes discussed in Section II, our new nonuniform space-time codes 
are based on differential encoding of the product of a unitary code matrix U b G X b associated with a basic 
message, and another unitary code matrix U & € X % corresponding to an additional message. Thus the 
transmitted code matrix at time t is given by (cf. (9)): 

X(t) = X(t-l)U b (t)U a (t) (13) 



Using (12) along with the fact that U b U A is unitary, we find that the average probability that a transmitted 
message pair (U® ,£7° ) is mistaken for another pair (£/ b ,E/ a ) can be bounded by: 

E[P{UlU»^U h ,U>)\ 

i - \{uvjj™u™ + u*tfu?u* )|" nr . (^)" nrn< ° 4) 

A bound such as (12) or (14) was used in, for instance, [17] for the design of (uniform) differential space- 
time codes. However, although it may be thought of as a feasible approach, an attempt to minimize this 
bound in the context of nonuniform space-time modulation may not produce the desired result since the 
target error rates for U b and U a are different. 



A. Design Criteria 

Since the additional message can be decoded only at high SNR, the matrices {C/J associated with the 
additional message should approximately be close to the identity matrix. Inspired by this observation, we can 
first consider the design of {U b }, treating the presence of C/ a as an unmodelled noise-like disturbance term. 
Doing so, we can approximately bound the error probability for U b alone (assuming differential detection) 
by: " 

E [PfJJi - £U] < |/ - \{ufU; + UJU*») • (j^f)"* (15) 

where a 2 is a factor that incorporates the noise-like effect of the presence of C/ a . Although the "bound" (15) 
is somewhat heuristic (and probably neither tight nor very accurate), we believe that it may serve a purpose 
as a meaningful design criterion for {U b }. 

Next, for the design of {U a } we proceed as follows. Suppose that the SNR is in a region such that U b can 
be reliably decoded. For the design of { LTJ this should be a reasonable assumption, since if it is not true then 
decoding of t/ a is probably of less interest anyway. Assuming that U b is known, the demodulation of t/ a 
is essentially another noncoherent detection problem. To obtain a criteria for the design of the constellation 
{Z7 a }, we want to form an error bound on U a and average it over all possible basic messages {U b }. Using the 
bound (4) for noncoherent detection appears to yield criteria that are very difficult to use. As a suboptimal 
approach we used instead the bound (3) for coherent detection. Doing so results in the following criterion: 

E [P(U? -» U.)] <^ £ \{Utf - U b U.)(U b U° - U b U.)»r ■ {£s)' 

U b€*b (16) 



where in the last step U b disappears since it is unitary. In (16), | - | denotes the number of elements of the 
set. 
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B. Union Bound on the Error Probability 

By the union bound, the probability for the basic message to be in error (averaged over all possible pairs 
of additional messages) can be bounded by: 



E [P(basic message in error)] 

— y 

\x\ 2 ^ 

jfc/n 



"1*1 



(17) 



-Ttr 



Likewise, the error rate for the additional message (averaged over all possible pairs of basic messages), can 
be bounded by a similar expression: 



E [P(additional message in error)] 

<J_ y 1 - 

-m ^ 



1*1 W 



These bounds will be useful for performance optimization. 



(18) 



-Tlr 



.2 x - r 



( p* \ ~ nrnt 



VI. Design Examples 

A R b = l, R e = l Code for 2 TX Antennas 

We first construct a code where the rate for the basic message is R* = 1 bit/sec/Hz and the rate for the 
additional message is R A = 1 bit/sec/Hz as well. We take the basic message U h taken from the following 
set: 



*-{[*!]. 





'0 -l" 




" 0 f 




"-1 


0 " 


]• 


1 0 


1 


-1 0 




0 - 


-1 



} 



(19) 



The constellation in (19), which is uniform and possesses certain optimality properties, is due to Hughes 
[17] and was called "BPSK" therein. Based on the design rules in Section V-A, we have handcrafted the 
following constellation of matrices for the additional message t7 a : 









e i,r7 0 " 




'e~ iwX 0 




- e -i7T 7 0 


■{ 


0 e™ 7 


5 


0 e™ x 


5 


0 e - ™ 7 


5 


0 e~ inX 



} 



(20) 



where (A, 7) are design parameters (to be discussed below). 

The constellation may appear to be somewhat arbitrary but it possesses some nice features. It is symmetric 
and uniform which means that the error probability performance should be the same for all constellation 
points. Also similarly to the nonuniform PSK encoding for a single channel (cf. Section II), multiplication 
with the matrices in (20) can be interpreted as adding a small phase shift to the elements of the basic message 
U b . 

If we take, somewhat arbitrarily: 



X(0) = 



1 1 
-1 1 



(21) 
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it follows that 

X H (t)X(t) = 21 (22) 

for all t and hence the differential encoding of the code is meaningful Furthermore, we can verify that all 
elements of X have always unit magnitude: 

|*w(*)| = l (23) 

and hence all transmitted symbols are obtained by constant envelope modulation, which was one of our 
design goals. 

The error performance (of both the basic and the additional message) will depend on A and 7, and typically 
the error performance of the additional message can be traded against that of the basic message. The values 
(A,7) must be chosen carefully. For instance, at first glance intuition would perhaps suggest us to take A > 0 
and 7 = 0, but we can show that such a "simple" choice leads to a code that "works" but that does not 
provide maximal diversity. 

In our experiments, we used the union bound (17) and (18) to optimize over (A, 7). In particular, for a 
given "tolerable" loss in performance for the basic message, we can find the pair (A, 7) that minimizes (18). 
The result is shown in Figure 2. For example, if we can accept a degradation of 1.5 dB for the basic message, 
then A = 0.2 and 7 = 0.035 are optimal. The optimization over (A, 7) is further illustrated in Figure 3, where 
we show how the performance associated with the basic and additional messages varies with (A, 7). 

Figure 4 shows the empirical bit-error-rate (BER), obtained via Monte-Carlo simulation, for the code 
described above using A = 0.2, 7 = 0.035 and ML decoding. The solid lines (" — ") show the performance 
of differential nonuniform BPSK for a conventional system with n t = 1 transmit antenna and a single 
receive antenna (this is essentially a special case of [10]). The dashed lines ("- -") show the performance 
for a system with n t = 2 (and a single receive antenna) using the new code presented above. For the curves 
without marks, only a basic message is transmitted. The curves with marks show the performance when 
both a basic and an additional message are transmitted: the curves marked with "o" show the BER for the 
basic message, and the curves with "x" show the BER for the additional message. Clearly, the transmit 
diversity system outperforms the conventional one - observe, in particular, the different slopes of the BER 
curves both for the basic and for the additional message. The simulation also confirms that the transmission 
of an additional message incurs a small performance degradation for the basic message. 

B. R b = 2, R a = 1 Code 

To obtain a code with a higher information rate for the basic message, we next take U h from the following 
algebraic group of 16 matrices (which is due to [17] as well) generated by: 



'e jn/4 0 




'0 -1" 


0 e~ j7r / 4 




1 0 



(24) 



The notation in (24) means that all possible matrices can be generated by choosing any arbitrary integers M 
and TV in the following expression: 



V*/ 4 0 


M 


'0 -1" 


N 


0 e- jn/i 




1 0 





(25) 



The constellation used for the additional message is chosen to be the same as (20), so we now have a system 
with rate R b = 2 for the basic message and rate R a = 1 for the additional message. The corresponding 
simulated BER is shown in Figure 5. 
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VII. Suboptimal Detector for Nonuniform D-STBC 

To detect both the basic and the additional messages in a joint ML fashion we have to search through all 
l<*b| * l^al possible combinations to find the combination of U b and C/ a which maximize the cost function in 
(1 1). This may be computationally burdensome if the constellation sizes of both layers are large. A possible 
remedy to this problem is to use a suboptimal detection rule described as follows: 

Step 1. Detect the basic message only using (7), neglecting the presence of the additional message: 



0b{t) = uSSU I [Y{t ~ l) Y{t)] ' n * 



<^ m ax **{Tt{U>(t)Y"(t)Y(t-l)}} 
Step 2. Detect the additional message assuming the decision taken in Step 1 is correct: 



2 

(26) 



min 



2 



[Y(t - 1) . U b (t) Y(t)] ■ Hi ua h 

r r m » \^ (27) 

» m^^[Tv{UXt)Y H {t)Y{t - l)U b (t)}} 

Let us elaborate more on why this suboptimal detection scheme works. Since E/ a is close to the identity 
matrix, when compared to the basic message U b it can be treated as small unmodelled noise; hence the de- 
tection rule in Step 1 can still be able to detect the basic message with only a small performance degradation. 
As the error performance of the basic message is typically superior to that of the additional message, we 
can assume the detected basic message is correct when we are dealing with the detection of the additional 
message. Thus the observation model for the detection of t/ a (i) becomes: 

. [Y (t - 1) • U b (t) Y(t)] 

= ifX(i-l).[<7 b (f) U b (t)U>(t)] + [E(t-l) E(t)] (28) 
= HX(t-l)U h (t)-[l U m (t)] + [E(t-l) E(t)] 

Treating HX(t - l)U b (t) as an unknown "effective" channel as in (10) and using (7) and (11) yields the 
detection rule in Step 2. 

By applying this suboptimal detection rule the computational complexity is reduced from \X b \ • \X>\ to 
|<*b| + |^a|- For example, for the R b = 2, = 1 code, the computational time is reduced from \X h \ • \X A \ = 
16 • 4 = 64 to \X b \ + \X Z \ = 16 + 4 = 20. For this example the reduction is not very significant; yet this 
suboptimal detection scheme does provide more flexibility as opposed to the joint ML detection because 
if only the basic message is needed, the computational load will be further reduced to \X b \ = 16 (for this 
example). 

We expect that there is some performance loss associated with this suboptimal detection scheme since 
it does not exploit knowledge of the structure of the additional message in the Step i. The Monte-Carlo 
simulation result in Figure 6 shows the performance of the suboptimal detector for the R* = 2, = 1 code 
compared to the performance of the joint ML detector. The detection of the basic message gives a small 
performance loss (« 0.3 dB) compared to the joint ML detector, whereas the detection of the additional 
message is (to within practical accuracy) equal to that of the joint ML detector. 

VIII. Comparison of Differential Group Codes and A Differential Alamouti Code 

A. Differential Alamouti Code 

The main advantage of the nonuniform space-time group codes introduced in Section VI is that the signal 
envelope is constant. However, also for this reason they cannot achieve the maximum possible performance 
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(in terms of BER) because they do not make use of any amplitude information in the signal. On the con- 
trary, orthogonal space-time block codes (OSTBC) should provide better performance (BER) since they are 
already optimized by construction (see [22], [23] and [15]). For 2 TX antennas OSTBC reduces to Alamouti 
code. The construction of an Alamouti code matrix is as follows: 



*i(t) s* 2 (t) 
s 2 (t) -s{(t) 



(29) 



where si and s 2 are the information bearing symbols and (•)* stands for the complex conjugate. For the R b = 
1 system, s x and s 2 are taken from the BPSK set {—1, 1}, and for the R b = 2 system, they are taken from 
the QPSK set {— 1, — j, The l/y/2 factor is included for normalization purposes so that the transmitted 
power per time interval equals one. This Alamouti matrix U(t) can then be encoded differentially by using 
(9) in Section IV. Note that the Alamouti code works together with the additional message C/ a taken from 
(20) to form a nonuniform constellation, and differentially encoded by (13), but unfortunately the matrices 
in the so-obtained constellation do not have constant signal envelope. 

Here we compare the performance of the group codes to that of the Alamouti code. The R* = 1 group 
codes (19) in Section VI and R b = 1 Alamouti codes are indeed equivalent, so there is no performance 
advantage for Alamouti codes for this case. The equivalence of both codes is easy to show as in this case the 
set of Alamouti matrices becomes: 





-i 


l 


-i 


l" 


i 


[i -HI- 


l -l 


'V2 


i 


i 


'72. 



(30) 



If we pre-multiply the above set by 



then we obtain the R b = 1 group code in (19). 
For the R b = 2 codes, the Alamouti code does provide somewhat better BER performance than the group 
codes in (24) and (20). This is so since the Alamouti code uses both signal amplitute and phase to transmit 
information; thus it should be less susceptible to the phase disturbance caused by the additional message. 
Figure 7 shows the simulated performance difference between the R b = 2, R^ = 1 nonuniform group codes 
in (24) and (20) and a R b = 2 Alamouti code together with the additional message (20). For the basic 
message the Alamouti code outperforms the group code by about ldB, and it is somewhat more robust when 
additional message is added. Meanwhile the performances of the detection of the additional message are 
equal both when the Alamouti and the group codes (20) are used as the basic message carrier. 



B. Alamouti Code with Differentially Encoded Symbols 

It may be argued that both the constant signaling envelope and maximum BER performance can be 
achieved at the same time by using symbol-wise differential encoding and forming the Alamouti matrix 
afterwards. However, unfortunately this encoding scheme does not work. This is demonstrated as follows. 

Suppose that we employ symbol-wise differential encoding: 

*i(t) = *i(t-l)-«i(f) nn 
x 2 (*) = x 2 (t-l)-s 2 (t)_ CJU 

and use these elements to form the Alamouti matrix: 



X(t-1) = 
X(t) = 



(32) 
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The above code matrix has the property that \xi(t)\ = \x 2 {t)\ = \si(t)\ = \s2(t)\ = 1 for all t and hence the 
signal envelope is constant for both transmit antennas all the time. Also X(t) is unitary for all t since it is 
an Alamouti matrix. The observation model (10) becomes (cf. Section IV): 

[Y(t-1) Y(t)} 

mH[X(t-l) X(t)] + [E(t-l) E(t)} (33) 
=HX{t-\)[l X H (t - l)X(t)] + [E(t - 1) E(t)] 

Employing the noncoherent detection rule (1 1 to the above model (33) the following detection rule is ob- 
tained: 

.,(«>«. ^{Tr{(^-i W) r"(t)y(t-i))} } 

max 

si(t),s 2 (t)€S 

I \l«2(«-l) -*l(«-l)Jl«2(*-l) ••»(«) -xl(t-l)-Sl(t)\ Y W^l' ^JJ ^ 34 > 



max 

ai(t),s 2 (t)e5 



Re Tr *iW + «2(0 *i(*-l)*5(*-l)-(«aW«*iW) y HM W n 

Ke lr xi(*-l)s 2 (f-l)>(si(*)-s 2 (i)) sI(t) + 52(«) Y 

The above maximization problem (34) cannot be solved unless x x (t - 1) and x 2 {t - 1) are known. For 
example, consider the following two sets of values: 

{si(t), 5 2 (i), *i(t - 1)«2(* - 1)} = {1, -1, 1} (35) 

and 

{*i(t), 5 2 (t), ^(t - l)rc 2 (t - 1)} = {-1, 1, -1} (36) 
These make the metric in (34) equal to: 

Re{ T r|[° " 0 2 ]y^(t)y(t-l)}} (37) 

which implies that the sets {si, s 2 } = {1, -1} and {s u s 2 } = {-1, 1} are not identifiable if x x (t - 1) and 
x 2 {t — 1) are unknown. 



IX. A Network Application Example 

We consider employing our nonuniform space-time codes in a broadcasting telecommunication system. 
For simplicity we assume that the base station is located at the center of a circular cell. Define the outage 
radius to be the radius of the area such that a specified detection performance (in terms of BER) is achievable 
with a certain probability that we call the Quality of Service (QoS). For example, if we let Rq denote the 
99% outage radius then within the area of radius Ro the message can be detected at a specified BER during 
99% the of total time. 

If nonuniform space-time codes are used to increase the data rate for the more capable receivers, it is 
clear that the outage radius for the transmission of the basic message under the same QoS will decrease to 
R b < Rq. Meanwhile the outage radius of the transmission of the additional message now becomes R a ; of 
course its value depends on QoS for the additional message. For the simplicity of our analysis here let us 
assume the same QoS for both the basic and the additional message. 
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Denote the "tolerable" performance loss for the detection of the basic message as A dB. Assuming log- 
distance fading it is straightforward to show that: 

| = 10-** (38) 

R 2 r 2 

jjf = 10-- n (39) 

where F is the performance difference (in dB) between the detection of the additional message and that 
of the standard uniform constellation system. The parameter n is the path loss exponent for the radio link 
environment (i.e.: Power oc (distance) ~ n ; for details see [24]). The above equations are also valid in a 
log-normal fading model. 

Figure 8 shows how R b and R a vary with A for the R* = 1, R> = 1 system, with the power attenuation 
factor of the radio link taken to be n = 2 and n = 4; these are typical values for free-space and downtown 
areas, respectively [24]. It can be seen that employing the nonuniform constellation does not increase the 
total throughput of the system (i.e., the sum of the data rate for all users) in the case n = 2. For example, 
if the outage area for the basic message is allowed to decrease to 0.8 times of that of the original cell area, 
the outage area of the additional message can only reach 0.15 times of that of the original cell area, which 
results in a decrease in overall system throughput. However for the case n = 4, the use of a nonuniform 
code increases the total throughput. This observation is not hard to understand; for n — 4 the area closely 
surrounding area a base station enjoys a relatively much higher SNR than the area close to the border of the 
cell and hence the outage area for the additional can be extended further. Note that the outage areas of the 
nonuniform layered code (19) and (20) (with R b = R a = 1) tend to approach that of the R b = 2 code (24) 
when A increases. 



X. Concluding Remarks 

We have presented new nonuniform space-time codes that can be encoded and detected differentially, and 
that are based entirely on phase-shift keying. We also discussed analytical criteria for code construction and 
optimization, and we compared its performance with that of a scheme based on the Alamouti code. We also 
studied a suboptimal detector and its performance, which we found to be satisfactory (a loss within 0.3 dB 
compared to joint ML decoding). We also demonstrated how nonuniform space-time codes in a broadcasting 
telecommunication system can increase the total throughput. 

It may be argued that using already established nonuniform constellations for single transmit antenna sys- 
tems together with, for instance, known linear space-time codes should be a natural approach to the problem 
of designing nonuniform space-time constellations. However, there are at least two problems associated 
with such an approach. First, it may in general not be optimal, simply because we are optimizing over the 
class of space-time codes and the class of nonuniform single-antenna constellations separately, instead of 
optimizing over the class of nonuniform MIMO constellations. Second, we found it difficult to incorporate 
constraints (such as constant envelope after differential encoding), that are desired or required from a practi- 
cal implementation point of view. Therefore, we believe that it may be advantageous to design nonuniform 
space-time constellations by approaching the problem from first principles. 
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Fig. 1 . A nonuniform 8-PSK constellation obtained from a standard "uniform" QPSK constellation by splitting each original constellation 
point **o" into a pair of new points "x". 
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Fig. 2, Optimal values of (A, 7) for the = 1, = 1 code, for a given acceptable performance degradation of the basic message, 
results are obtained via minimization of the union bound (17) and a corresponding expression for the error rate of the additional message. 
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Fig. 3. Performance degradation for the basic message and performance gain for the additional message for the R\> = 1, R a = 1 code. The 
results are obtained via (1 7), along with a corresponding expression for the additional message. The curves for the basic message are normalized 
relative to the "undisturbed case" (A = 7 = 0) and the curves for the additional message are normalized relative to X = 0.2, 7 = 0.035. 
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Fig. 4. Empirical BER for the R b = 1, fl a = 1 code with A = 0.2. 7 = 0.035. 
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Fig. 6. Monte-Carlo Simulation of the BER performance for the suboptimal receiver (26) and (27) in Section VII, for the joint ML receiver and 
for the basic message only system with R b = 2, R a = 1 code. The solid and dash curves with "x" overlap. 
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Fig. 7. Monte Carlo Simulation result for the BER performance of Rt> = 2, B n = 1 system, with differential group codes and differential 
Alamouti codes. Note that the solid and dash curves with "x" marks overlap. 
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Fig. 8. Outage area for the basic/additional messages as a function of the "tolerable" performance loss, for the Ft* = 
horizontal curves represent the relative outage area for the R b = 2 code with a basic message only. 
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We propose new space-time codes tailored to point- to-multipoint, or 
broadcast, communications using layered source coding. Our codes can be 
encoded (and decoded) differentially, and they are based entirely on phase-shift 
keying. We develop analytical design criteria for the codes, and we discuss 
the design of optimal and suboptimal receiver structures. We also discuss the 
relation between our new codes and a differentially encoded Alamouti code. 
Numerical examples illustrate the performance of our new codes. Convolutional 
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improve the performance of the convolutional codes. For completeness, we also 
include some fundamental concepts of the space-time coding system and the 
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CHAPTER 1 
INTRODUCTION 

During the last decade, wireless systems with multiple transmit and receive 
antennas have been studied extensively, and the performance of such systems 
has been proven to be extremely promising. The propagation channel associated 
with a system with multiple transmit and receive antennas is sometimes called a 
multiple-input multiple- output (MIMO) channel, and the associated coding and 
signal processing is referred to as space-time coding. Using proper space-time 
coding, it is possible to use the degrees of freedom of the MIMO channel both 
to increase the throughput and to counteract fading. For this reason, MIMO 
technology is believed to become a major cornerstone in many future wireless 
communication systems. 

Loosely speaking, communication links can be classified into two categories: 
point-to-point links and point-to-multipoint links. In the former case, there is 
exactly one transmitter and one receiver which communicate with each other at 
a given time, whereas for a point-to-multipoint link, a transmitted message is 
aimed at multiple different recipients simultaneously. A cellular communication 
system with mobile users is an example of a point-to-point communication 




Figure 1-1: A point-to-point link. 
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Figure 1-2: A point-to-multipoint link. 



system, whereas a radio/TV broadcast is an example of a point-to-multipoint 
link (therefore point-to-multipoint links are sometimes referred to as broadcast 
channels). 

Point-to-multipoint links are becoming increasingly important. For in- 
stance, the introduction of Digital Audio Broadcast (DAB) and High-Definition 
Television (HDTV) has pioneered a whole new field of digital broadcasting 
applications. As a further example, it is widely believed that much of the next 
generation's wireless networking will be based on so-called ad-hoc networks, 
where it may be necessary for multiple units to listen to one message at the 
same time. Also, in conventional cell communication system using directional or 
adaptive antennas, it is sometimes necessary to broadcast a message to the entire 
cell. 

There are two major differences between point-to-point and point-to- 
multipoint communication links. First, a point-to-point connection can be 
optimized for a given transmitter-receiver pair. For instance, a cellular system 
usually employs power control techniques that adjust the transmitted power 
to minimize the power consumption, reduce the amount of co-channel interfer- 
ence, and at the same time ensure that the received signal strength exceeds a 
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certain threshold. Second, in contrast to a point-to-point communication link, 
for a broadcast transmission all receivers have different capabilities to decode the 
transmitted message. This is so because the different receivers experience in gen- 
eral very different radio link qualities (for instance, due to varying propagation 
conditions). Moreover, since quality can usually be traded for cost, the receivers 
themselves may have different inherent abilities to decode the transmitted 
message. 

The fact that distinct receivers have different capabilities of decoding a mes- 
sage suggests that the transmitted signal should consist of several components 
which are of different importance for the reconstruction of the message (and 
therefore have an inherently different vulnerability to transmission errors). This 
idea has lead to the concept of layered source coding (e.g., [1, 2, 3, 4]), which is 
now a mature technique employed in many multimedia standards. For instance, 
the image coding standard JPEG-2000 and the video coding standard MPEG-4 
use what is sometimes referred to as "fine granularity scalability," which enables 
a gradual tradeoff between the error-free data throughput and the quality of 
the reconstructed image or video sequence [5]. Such progressive source coding 
methods are already in use in many Internet applications where data rate can 
be traded for quality, and they are expected to play an instrumental role for the 
next generation of wireless standards to provide ubiquitous access both to the 
Internet, and to diverse sources of streaming video and audio. 

The topic of this thesis is space-time coding for broadcast channels when 
layered source coding is used. Previous works on the topic include Memarzadeh 
et al. [6], which discussed "beamforming" techniques (hence requiring feedback of 
channel state information) for a multiuser system. In Memarzadeh et al. [6], and 
two kinds of beamforming techniques are discussed. Zero-forcing beamforming is 
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introduced which orthogonal codewords are used for each user so that inter-user 
interference can be fully eliminated just as in an ideal CDMA system, provided 
that exact channel state information is available at the transmitter. Another way 
is the single user optimal beamformer, for which no attempt is made to avoid 
inter-user interference. This is suitable for the scenario when the channel state 
information at the transmitter is not very accurate, as interference suppression is 
not effective. 

Kuo et al. [7] proposed a layered space- time coding scheme assuming 
different receivers may have different numbers of receive antennas, and that a 
receiver with more receive antennas can decode more layers of messages. 

In this thesis, we suggest a new layered space-time coding scheme that does 
not require any channel knowledge at the transmitter, and which is constructed 
starting from a criterion that attempts to minimize the error rate. Our new 
codes also satisfy two other important design goals. First, they can be encoded 
(and detected) differentially. Second, they are entirely based on phase-shift 
keying (PSK) modulation and consequently the transmitted signal has constant 
envelope at all times. The codes proposed in this paper can therefore be seen 
as a multidimensional extension of a transmission technique for single-antenna 
systems in Pursley et al. [8]. 



CHAPTER 2 
BROADCAST CHANNELS 

Broadcast channels are defined as the simultaneous communications between 
a single source and several receivers. 

Early works include Cover et al. [9] and Bergmans et al. [10]. In Cover 
et al. [9], They discuss the nature of broadcast channel from the perspective 
of information theory, and derives upper bounds of the capacity of broadcast 
channels of different types. It is proved that the maximum data rate of each 
channels are jointly achievable by proper channel coding. By superimposing side 
information to the common message intended for all receivers, different data rates 
are possible for different channels. This idea is similar to the nonuniform codes 
which we will discuss later on. 

Due to the different capabilities of each receivers, and indeed the capabilities 
of each receivers are varying from time to time (i.e., due to channel fading or 
fluctuating noise level), the theoretical maximum capacity of a broadcast channel 
is not easy to achieve in a practical system. 

We have no intention to compare the performance of our nonuniform codes 
with the theoretical channel capacity in this thesis, but as our nonuniform codes 
contain data in multiple layers, we believe that we are in the right direction that 
our nonuniform codes can increase the total throughput of a broadcast system. 
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CHAPTER 3 
LAYERED SOURCE CODING 

In this chapter we briefly introduce the concept of layered source coding. 
JPEG-2000 is taken as the example here. 

JPEG is an image compression standard developed by the Joint Photo- 
graphic Experts Group [11]. For the encoding, the pixel bit map information of 
a whole image is divided into 8x8 pixel blocks. Each 8x8 pixel block then 
undergoes a discrete cosine transformation (DCT). This helps separate the image 
into parts (or spectral sub-bands) of different importance (with respect to the 
image's visual quality). The DCT is similar to the discrete Fourier transform as 
it transforms a signal or image from the spatial domain to the frequency domain. 
For most images, much of the signal energy lies at low frequencies, whereas the 
higher frequency components are often small and usually they can be neglected 
with little visible distortion. Thus the DCT indeed separates the original image 
data into separate layers of different importance: the low frequency components 
are important for the reconstruction of the image; the higher frequency terms 
are not that important although they enhance the visual quality of the image. 
From this observation, we infer that the low frequency parts of data must be sent 
through a reliable data channel with very good bit-error-rate (BER) performance 
whereas the high frequency parts can be sent through a less reliable channel 
as the occurrence of bit errors will only deteriorate the quality to a negligible 
extent. 
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With layered source coding (for instance as mentioned above, JPEG-2000 
and MPEG-4), more capable receivers, for example with higher signal-to-noise- 
ratio (SNR), can achieve a higher data rate by decoding the messages contained 
in all layers while the less capable receivers can only decode the message in the 
bottom layer. Thus the "most important" information should be conveyed via 
the bottom layer while "less important" information can be transmitted via 
higher layer (s). 

An example of a modulation scheme designed for layered source coding is 
nonuniform phase-shift- keying modulation (PSK), which was first introduced 
by Pursley and Shea [8] (see also Pursley et al. [12, 13]). The idea is intuitively 
appealing. Starting with a standard uniform PSK constellation, which contains 
data for the "basic" layer, a nonuniform PSK code constellation is obtained 
by adding a small additional phase shift a to each original constellation point, 
which contains the information for the "additional" layer. This is illustrated in 
Figure 3-1, when the basic message is conveyed via a QPSK constellation. A 
capable receiver with high SNR can distinguish among all eight constellation 
points which means that both the basic and the additional layer messages can 
be decoded, whereas to a less capable receiver the constellation may appear 
like a blurred QPSK constellation and hence it may only be able to distinguish 
between the different "large" phase shifts; thus only the basic message can be 
detected accurately. By such a construction the error rates associated with the 
basic and the additional message are different. Consequently, the additional 
message can carry information that is of less importance for the reconstruction 
of the transmitted message than the basic message is. The error probability of 
the additional message can be easily adjusted by choosing different values of the 
additional phase shift a. Obviously, a larger a results in a better performance 
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Figure 3-1: A nonuniform 8-PSK constellation obtained from a standard "uni- 
form" QPSK constellation by splitting each original constellation point "o" into a 
pair of new points "x". 



for the additional message at the cost of a deterioration of the performance for 
the basic message. The main virtue of this nonuniform PSK encoding scheme 
is that the signaling envelope is constant while multi-layered messages can be 
transmitted with different and adjustable error performance. 



CHAPTER 4 
SPACE-TIME CODING 

4.1 Error Performance of STBC Systems 

A conventional wireless system with a single transmit and receive antenna 

suffers from a large performance degradation due to the fading of the radio 

channel Fading is the fluctuation in received signal level. It is mainly due to 

changes in the propagation environment, such as weather and moving reflective 

objects /obstructions . 

Fading can be classified as fast fading or slow fading, depending on how 

rapidly the channel impulse response changes. For a fast fading channel, the 

channel impulse response changes rapidly within one symbol duration, which 

means that the coherence time of the channel T c (which is a measure of the time 

duration that the channel impulse response is quasi-static) is smaller than the 

symbol period of the transmitted signal 

T s > T c (4.1) 

For a slow fading channel, the channel impulse response changes very slowly 
compared with one symbol duration, which means that the channel can be seen 
as quasi-static for many symbol durations, that is 

T s < T c (4.2) 

Figure 4-1 shows an example of the fast fading of the gain of a single radio 
channel. The two curves represent two channels encounter independent fast 
fading. We can see that there is a high risk that a single channel may encounter 
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Figure 4-1: Received signal level may fluctuate vastly for a single radio channel. 

a deep fade at a certain time. During deep fade the transmission quality may be 
very poor or the transmission may simply be impossible. 

To counteract the fading radio environment a multiple transmit and receive 
antenna system (or MIMO) is one attractive possibility. The advantage of a 
multiple transmit and receive antenna is not difficult to comprehend intuitively: 
There are more than one channel which are not fully dependent, so the possi- 
bility that all of them encounter a deep fading at the same time is small; thus 
the deep fading problem may be solved. Figure 4-2 shows the total channel gain 
(denote the individual channel gains be h\ and /i 2 , total channel gain is then 
equal to \h x \ 2 + \h 2 ^) of two independent fading channels. It can be seen that the 
risk of the total channel gain encountering a deep fade is largely reduced. 
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Figure 4-2: Total channel gain of two independent fading channels. 



The corresponding encoding scheme for such a multi transmit and receive 
antenna system is called space-time coding. This is an emerging topic that now 
attracts many researchers' attention since its introduction about a decade ago. 

The main virtue of such a multi transmit and receive antenna system is 
that higher diversity order can be achieved. The diversity order can be defined 
implicitly by 

Bit-Error-Rate oc (SNR)~ diversit y (4.3) 

which means that the bit-error-rate decreases as fast as the Signal-to-Noise-Ratio 
(SNR) to the power of minus the diversity order. For a high SNR the advantage 
of a high diversity order can be substantial. 

Consider a MIMO system with n t transmit antennas, n r receive antennas 
(see Figure 4-3), and assume for simplicity that the propagation channel is 
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Figure 4-3: A wireless link with multi transmit and receive antennas. 

linear, time-invariant and frequency flat. Let H be an n r x n t matrix whose 
(m,n)th element contains the channel gain between transmit antenna n and 
receive antenna ra, and suppose that a code matrix X of dimension n t x N is 
taken from a matrix constellation X and transmitted by sending its N columns 
via the n t antennas during N time epochs. If the signal received during the same 
N time intervals is arranged in an n r x N matrix Y, then 



where E is an n r x N matrix of noise. Throughout this thesis, we shall make the 
somewhat standard assumption that the elements of H and E are independent 
and zero- mean complex Gaussian with variances p 2 and a 2 , respectively; hence 
the channel is Rayleigh fading. 

The Shannon channel capacity of such a MIMO system is a well-known 
result, and it can be shown that (for details see [14] chapter 3 and the references 



Y - HX + E 



(4.4) 
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therein) 

C(H) = B • log, | J + \HPH H \ (4.5) 

where B is the bandwidth of the channel in Hz and P is the covariance matrix 
of the sequence of transmitted data vectors 

P = E[x n xZ\ (4.6) 

If the transmitted signals are uncorrelated at different transmit antennas and we 
use unit power at each transmit interval, then 

P=±-I (4.7) 
n t 

(4.5) reduced to 

C(H) = B ■ log 2 1 J + ^HH H \ (4.8) 

Clearly, the channel capacity depends on the eigenvalues of HH H . 

Given Y, detection of X in a maximum-likelihood (ML) sense amounts to 
minimizing the Euclidian distance 

|| Y - HX\\ (4.9) 

with respect to X G X, as well as H (unless it is known). The average (over 
H) probability that the ML detector mistakes a transmitted code matrix X 0 
for an incorrect (and different) code matrix X ^ Xo is called the pairwise error 
probability For completeness a brief derivation of this probability is provided here 
for both coherent and noncoherent detection. 

Coherent: Suppose that X 0 is the true transmitted code matrix, thus 
Y = HX 0 + E. Then the probability that an incorrect decision in favor of a 
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different code matrix X ^ X 0 occurs is 

P(error|H) =P(\\Y - HX\\ 2 < \\Y - HX 0 \\ 2 \H) 

=P{\\HX + E\\ 2 <\\E\\ 2 \H) 

, \ (4-10) 

= P(2ReTr {E H HX} > \\HX\\ 2 \HJ 

. ( \\HXf \ 
^ eXP ( 

where X = X 0 — X, and | • | denotes the determinant of a matrix. In (4.10), 
the second equality follows by some matrix algebra and the last step is a conse- 
quence of the fact that 2ReTr{E*HX} is zero-mean Gaussian with variance 
2a 2 ||iJX|| 2 together with the Chernoff bound. Under the assumptions made, the 
probability density function (p.d.f.) for H is 

P(H) = ^^e^(-±\\H\\ 2 ) (4.11) 

Averaging (4.10) over H using (4.11), we get after some calculations 
P(error) = J dH P(error\H)p(H) 
< (V)- nrnt / dH exp ( - -^WHXW 2 - i||H|| 2 ) 
= (V) - """ J dH exp ( - Tr j H ■ l(£^x" + /) • H H j) 



(4.12) 



2 * — n r nt 



£,xx H + i[ nr < \(X - X 0 )(X - x 0 ) H \- n ' (£,) 

Noncoherent: If H is unknown, we can obtain a detection rule that depends 
only on X. Consider the problem of minimizing (4.9) 

^mrn^ \\Y-HX\\ 2 (4.13) 
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Treating H as a deterministic unknown, we can minimize this likelihood function 
with respect to H. Notice that 

\\Y-HX\\ 2 = \\Y H -X H H H \\ 2 

= \\(U X * -X*H*)\? 

= \\ll xH (Y» - X»H»)\\ 2 + \\Tl xH (Y H ~ X H H")\\ 2 (4 14) 

+ 2 Re (Tr{(Y - HX)U x h ■ U xH (Y H - X H H H )]) 

„ ' 

o 

= \\yi x mY h - x h h h \\ 2 + ||n^H^|| 2 

Here U x h = X H (XX H )~ l X is the orthogonal projector of X H . The function 
in (4.14) is minimized when 

U x hY h -X H H H = 0 
^X H (XX H )- l XY H - X H H H = 0 (4.15) 
*=*X H ((XX H )- l XY H - H H ) = 0 

If X has full rank then the above leads to the result (see, e.g., [15] and [14] 
appendix B) 

H = YX H (XX H )~ l (4.16) 

(It may be argued that if we know some sort of statistical properties of if, 
we should treat H as a stochastic variable when it enters the likelihood function. 
This is intuitive since we should incorporate all known information into the 
detector to get the best result. But in Larsson et al. [16] (see also Larsson et al. 
[14] exercise 9.7) it is shown that both approaches indeed give the same result. In 
this paper we will stick to the deterministic channel approach for simplicity.) 



Inserting (4.16) into (4.13) yields 
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nun \\Y-YX H (XX H )->X\f 

mm \\YU xH f (4.17) 

max ||n x HY^|| 2 



Let 



A n =n X H-U X H (4.18) 

Note that the matrix A n is Hermitian, but not necessarily positive definite. Also 
note that 

AnXg = (U x h - U x h)X^ 

= x 0 "-n X H* 0 " ( 419 ) 

Hence, since Y\^. H X^ has full rank by assumption, the matrix XqYI^hX^ is 
positive definite. 

The probability that the ML detector makes a mistake is 

P(error|H) = p(||n x „Y"|| 2 > \\U x uY"\\ 2 \h) 
=p(lY {(HXo + E)A n (HX 0 + E) H ) < 0|h) 

=p(2ReTr{ J H"X 0 A n £"} 

V ' . (4-20) 

+ Tr{EA n E H } < -Tv{HX 0 A n X»H H }\H) 

=p(2ReTt{HX 0 n x „E H } 

+ Tr{EA n E H } < -\\Tl xH X%H h \\ 2 \h) 

where in the last step we used the fact that Yl xH U. xH = n^ H . 
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The term EA n E H is of higher order and may be neglected in an asymptotic 
analysis. In particular, for a given if, the variance of its elements is of the order 
a 4 , whereas the elements of HXqYI^hE 1 * have a variance of the order a 2 . Since 

dQ(x) 1 ^ 1 . ..... 

V = — F=e 2 < — _ < oo (4.21) 

Thus a small finite error in variance can only caused a small finite error in the 
Q-function also, so the additional small variance caused by this higher order term 
can be neglected. 

By the Chernoff bound we have 

p(2ReTr{HX 0 U^ H E H } < -\\U^ h X^H h \\ 2 \h) 

Hence, by averaging in a way similar to (4.23), we get 
£ H [P(error)] = J dH P(erroT\H)p(H) 
< (V)-"" n ' / dH exp ( - ±\\n± H X»H H r - l||/f|| 2 ) 
= (np 2 )- nrnt J dH exp ( - Tr jff • ^(^X 0 U^ H X^ + I) ■ H ff }) 

= ^-2 X ^x^ + l[ " < \X^ x „X»\-<- ■ (^P nt 

To summarize, the error bounds for coherent and noncoherent detection in a 
STBC system are the following 

E [P(X 0 -* X)\ <\(X- X 0 )(X - X 0 )T nr • (h) nrnt ( 4 24 ) 

coherent ' \4(J Z / 

and 

E[P(X 0 -» X)) < \X^ X „X% \~ n ' ■ (4.25) 

noncoherent V frC7 / 



(4.23) 
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A special subclass of space-time codes is the linear orthogonal space-time 
block codes (OSTBC). The case when OSTBC applied to a 2-TX system turns 
out to be the well-known Alamouti code [17]. 

The main virtue of OSTBC is that they achieve a diversity of order n r n t for 
a system with n t transmit and n r receive antennas, at an almost negligible com- 
putational cost. Specifically, for such a system, a set of n s symbols {51, ...,s n J 
are encoded into &nn t xN matrix that has the following structure 

1 OSTBC — J2(snA n + s* n B n ) (4.26 

71=1 

The columns of Xostbc are transmitted via the n* transmit antennas during N 
time intervals. In (4.26), {A n) B n } are matrices chosen such that 



for all {s n }. In general, the role of the orthogonality condition (4.27) and the 
associated design of {A n) B n } is relatively well understood by now; for instance, 
(4.27) implies that the symbols {s n } can be detected independently of each other 
[14], so with the use of OSTBC, the MIMO channel can be decoupled into n s 
number of single additive-white-gaussian-noise (AWGN) channels (See Figure 
4-4). 



Space-time modulation matrices that can be encoded differentially are 
sometimes of special interest since such codes can easily be demodulated nonco- 
herently. Differential codes with uniform error probabilities have been studied 
by many authors (see, e.g., [18, 19, 20, 21, 22] for some prominent examples) 
and some of them can be seen as an extension of differential PSK to MIMO 
systems. In general, if t is the time index and if {U(t)} is a sequence of (square) 




(4.27) 



n=l 



4.2 Differential Modulation for MIMO Systems 
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Figure 4-4: OSTBC decouples a MIMO channel into n s number of AWGN chan- 
nels. 
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information-bearing matrices, then differential encoding obtains the transmitted 
code matrix X(t) at time t via 



X(t) = X(t - l)U(t) 



(4.28) 



where X(t — 1) is the matrix transmitted at time 4 — 1. Such an encoding 
usually is only meaningful under certain circumstances, for instance if U(t) is 
unitary (in which case X(t) becomes unitary for all t as well, given a unitary 
"initial matrix" X(0)). By considering two (or more) received matrices Y(t) 
and Y(t — 1) simultaneously, noncoherent detection is possible. For example, if 
we concatenate the matrices received at time t — 1 and f , and assume that the 
channel if remains constant over these two time intervals (to within practical 
accuracy), we can write (using (4.4) and (4.28)) 



Y(i-l) Y(t) 



HX(t-l) HX(t) 



+ 



E(t-1) E(t) 



HX(t-l) HX(t-l)U(t) 



+ 



E(t-1) E(t) 



(4.29) 



--HX(t - 1) 



I U(t) 



+ 



E(t-1) E{t) 



where HX(t — 1) can be seen as an unknown "effective" channel matrix and 



I U(t) 



I U(t) 



is such that 



is an effective code matrix. Provided that 
the corresponding determinant in (4.25) is nonzero, noncoherent demodulation 
is possible. By applying the decision rule (4.17) to the data model (4.29), the 



detection rule for differential demodulation becomes 
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nun 



min Tr 
u(t)ex 



min Tr 
u{i)ex 



Y(t-1) Y(t) 
Y(t-1) Y(t) 

Y(t-1) Y(t) 



n- 1 - 



U U u(t)\ H 



Y H (t - 1) 
Y H (t) 



I -U(t) 
-U H (t) I 





Y H (t-l) 


j 




. Y " {t) . 





(4.30) 



« Re { Tr {U(t)Y H (t)Y(t - 1)} } 

where Tr{-} stands for the trace of a matrix and where we used the fact that 
the information-bearing matrix U(t) is unitary. The associated average error 
probability, i.e., the probability that is incorrectly detected as U for this 
differential detection scheme can be bounded by using (4.25) 



E [p(l/(°> -> U) 



< 



I U<® 



n [7 U\» 



I 



( P \- n <- n * 



(4.31) 



i-L(uu<v h + uwu u ) 

2 



For simplicity hereafter the time index t of U(t) is omitted whenever no confu- 
sion can occur. 

4.3 Nonuniform Space-Time Codes 
Inspired by the nonuniform PSK codes discussed in Section 3, our new 
nonuniform space-time codes are based on differential encoding of the product 
of a unitary code matrix U h G X h associated with a basic message, and another 
unitary code matrix C/ a e X & corresponding to an additional message. Thus the 
transmitted code matrix at time t is given by (cf. (4.28)) 



X(t) = X(t - l)U h (t)U & (t) 



(4.32) 
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Using (4.31) along with the fact that UJJ^ is unitary, we find that the 
average probability that a transmitted message pair (t/°,t/°) is mistaken for 
another pair (U h ,U & ) can be bounded by 



E[P(U°,Ul^U b ,UJ] 

2 _ (4-33) 



< 



A bound such as (4.31) or (4.33) was used in, for instance, Hughes et al. [18] 
for the design of (uniform) differential space-time codes. However, although it 
may be thought of as a feasible approach, an attempt to minimize this bound in 
the context of nonuniform space-time modulation may not produce the desired 
result since the target error rates for U h and l/ a are different. 
4.3.1 Design Criteria 

Since the additional message can be decoded only at high SNR, the matrices 
{C/ a } associated with the additional message should approximately be close to 
the identity matrix. Inspired by this observation, we can first consider the design 
of {t7 b }, treating the presence of ?7 a as an unmodelled noise-like disturbance 
term. Doing so, we can approximately bound the error probability for U h alone 
(assuming differential detection) by 

£ [P(£7» - t\)] 



r-;(Bftf + ^ -(S^P (434, 



where a 2 is a factor that incorporates the noise-like effect of the presence of t/ a . 
Although the "bound" (4.34) is somewhat heuristic (and probably neither tight 
nor very accurate), we believe that it may serve a purpose as a meaningful design 
criterion for {U h }. 

Next, for the design of {U & } we proceed as follows. Suppose that the SNR 
is in a region such that U h can be reliably decoded. For the design of {U & } this 
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should be a reasonable assumption, since if it is not true then decoding of U & is 
probably of less interest anyway. Assuming that U h is known, the demodulation 
of U a is essentially another noncoherent detection problem. To obtain a criteria 
for the design of the constellation {E/ a }, we want to form an error bound on £/ a 
and average it over all possible basic messages {U h }. Using the bound (4.25) for 
noncoherent detection appears to yield criteria that are very difficult to use. As 
a suboptimal approach we used instead the bound (4.24) for coherent detection. 
Doing so results in the following criterion 

e [P{ul - t/j] <p^ E I - tW(tfbtf? - u h u m )«r- ■ (£^P H1 

=\{ui-u^ui-u>rr\-^y nrnt 

(4.35) 

where in the last step U b disappears since it is unitary. In (4.35), | ■ | denotes the 

number of elements of the set. 

4.3.2 Union Bound on the Error Probability 

By the union bound, the probability for the basic message to be in error 
(averaged over all possible pairs of additional messages) can be bounded by 

E [P(basic message in error)] 

— \x I \x I 2 Z_-/ 

kjtn 

- \ (yi*u*)u?)*uP" + uPu?uWuP H ) ^ ■ (^) ~ nrnt 

(4.36) 



Likewise, the error rate for the additional message (averaged over all possible 
pairs of basic messages), can be bounded by a similar expression 

E [^(additional message in error)] 
.1 v — ^ 1 
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j _ l(u[ k) u^u^ H u^ )H + u[ n) uyu? H ui k)H ) 



( p \~" rni 



(4.37) 



These bounds will be useful for performance optimization. 

4.4 Design Examples 
4.4.1 ^ = 1,^ = 1 Code for 2 TX Antennas 

We first construct a code where the rate for the basic message is R h — 1 
bit/sec/Hz and the rate for the additional message is i? a = 1 bit/sec/Hz as well. 
We take the basic message U h taken from the following set 



1 0 




0 -1 




0 1 




0 1 


> 


1 0 




-1 0 





-1 0 
0 -1 



(4.38) 



The constellation in (4,38), which is uniform and possesses certain optimality 
properties, is due to Hughes et al. [18] and was called "BPSK" therein. Based on 
the design rules in Section 4.3.1, we have handcrafted the following constellation 
of matrices for the additional message t/ a 









0 






0 




g-«T 7 


- 

0 


0 


5 


0 




) 


0 




> 


0 





} 



(4.39) 



where (A, 7) are design parameters (to be discussed below). 
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The constellation may appear to be somewhat arbitrary but it possesses 
some nice features. First, all matrices in (4.39) are unitary, this is a requirement 
for MIMO differential encoding. Second, it is symmetric and uniform which 
means that the error probability performance should be the same for all constel- 
lation points. Also similarly to the nonuniform PSK encoding for a single channel 
(cf. Section ??), multiplication with the matrices in (4.39) can be interpreted as 
adding a small phase shift to the elements of the basic message U h . 

We may also think of other forms of matrices as the additional message. For 
instance, consider the following anti-diagonal matrix 



0 e i,rA 
e <W7 0 



(4.40) 



But we immediately find that this does not work. Consider, for example, 



1 0 
0 1 



(4.41) 



u h -u m = 



0 e 



(4.42) 



Then, 

10 0 e 
0 1 ^ 0 

It is seen that multiplying this additional message t/ a not only introduces a small 
phase shift but also interchanges the columns of the original basic message U h . 
This is not acceptable since our design goal is to have the additional message 
only pose a small disturbance to the basic message so that for those less-capable 
receivers can still detect the basic message even without knowledge of the 
existence of the additional message. The U & here simply destroys the original 
basic message constellation so it is not feasible. 
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Let us now return to our design task. If we take the initial transmit matrix, 



somewhat arbitrarily 

i 1 1 

(4.43) 



1 1 
-1 1 



The factor ^= is inserted to normalize the transmitted power to unity for each 
time interval It follows that 

X H (t)X(t) = I (4.44) 

for all t and hence the differential encoding of the code is meaningful. Further- 
more, we can verify that all elements of X have always constant magnitude: 

\X k:l (t)\ = ^= (4.45) 

and hence all transmitted symbols are obtained by constant envelope modulation, 
which was one of our design goals. 

The error performance (of both the basic and the additional message) will 
depend on A and 7, and typically the error performance of the additional message 
can be traded against that of the basic message. The values (A/y) must be chosen 
carefully. For instance, at first glance intuition would perhaps suggest us to take 
A > 0 and 7 = 0, but we can show that such a "simple" choice leads to a code 
that "works" but that does not provide maximal diversity. 

In our experiments, we used the union bound (4.36) and (4.37) to optimize 
over (A, 7). In particular, for a given "tolerable" loss in performance for the basic 
message, we can find the pair (A, 7) that minimizes (4.37). The result is shown in 
Figure 4-5. For example, if we can accept a degradation of 1.5 dB for the basic 
message, then A = 0.2 and 7 = 0.035 are optimal. The optimization over (A, 7) is 
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Figure 4-5: Optimal values of (A, 7) for the R h = 1, R A = 1 code, for a given ac- 
ceptable performance degradation of the basic message. The results are obtained 
via minimization of the union bound (4.36) and a corresponding expression for 
the error rate of the additional message. 

further illustrated in Figure 4-6, where we show how the performance associated 
with the basic and additional messages varies with (A, 7). 

Figure 4-7 shows the empirical bit-error-rate (BER), obtained via Monte- 
Carlo simulation, for the code described above using A = 0.2, 7 = 0.035 and ML 
decoding. The solid lines (" — ") show the performance of differential nonuniform 
BPSK for a conventional system with n t — 1 transmit antenna and a single 
receive antenna (this is essentially a special case of Pursley et al. [8]). The 
dashed lines ("- -") show the performance for a system with n t = 2 (and a single 
receive antenna) using the new code presented above. For the curves without 
marks, only a basic message is transmitted. The curves with marks show the 
performance when both a basic and an additional message are transmitted: the 
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Figure 4-6: Performance degradation for the basic message and performance gain 
for the additional message for the R h = 1, i? a = 1 code. The results are obtained 
via (4.36), along with a corresponding expression for the additional message. The 
curves for the basic message are normalized relative to the "undisturbed case 5 ' 
(A = 7 = 0) and the curves for the additional message are normalized relative to 
A - 0.2, 7 - 0.035. 

curves marked with "o" show the BER for the basic message, and the curves 
with "x" show the BER for the additional message. Clearly, the transmit 
diversity system outperforms the conventional one - observe, in particular, the 
different slopes of the BER curves both for the basic and for the additional 
message. The simulation also confirms that the transmission of an additional 
message incurs a small performance degradation for the basic message. 
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Signal-to-noise ratio [dB] 

Figure 4-7: Empirical BER for the R b = 1, i?, = 1 code with A = 0.2, 7 = 0.035. 



4.4.2 R b = 2, = 1 Code 

To obtain a code with a higher information rate for the basic message, we 
next take U b from the following algebraic group of 16 matrices (which is due to 
Hughes et al. [18] as well) generated by 
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e W4 o 




0 -1 


0 e~ jn/4 




1 0 



(4.46) 



The notation in (4.46) means that all possible matrices can be generated by 
choosing any arbitrary integers M and N in the following expression 

n N 



gJTT/4 0 


M 


0 -1 


0 e~i n/A 




1 0 



(4.47) 



The constellation used for the additional message is chosen to be the same as 
(4.39), so we now have a system with rate R h — 2 for the basic message and rate 
i? a — 1 for the additional message. The corresponding simulated BER is shown in 
Figure 4-8. 

4.4.3 Receive Diversity 

To demonstrate the transmit diversity achieved for the 2-TX system in the 
simplest way, our results above are based on the 1-RX case. Although of this, our 
newly proposed nonuniform STBC codes are not only useful for 1-RX case but 
also for a general n r -RX case. This is obvious since our theoretical derivations 
above are not tailored for the 1-RX case, hence it should work automatically 
for a general n r -RX case. Also we could expect a diversity order of n T n t can be 
achieved. This is so since we can see the data model of a n r -RX n r TX system as 
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=1, DQPSK (basic msg. alone) 

=1, DQPSK (basic msg.) 

=1,BPSK (add. msg.) 
2, Code (4.46) (basic msg. alone) 
2, Code (4.46)-(4.39) (basic msg.) 
2, Code (4.46)-(4.39) (add. msg.) 
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Figure 4-8: Empirical BER for the R b = 2, i? a = 1 nonuniform space-time con- 
stellation. The parameters were A = 0.078, 7 = 0.018 (found via optimization of 
(4.36)). 

n r different 1-RX n t -TX system 

Y = HX + E 



















x + 












3v_ 



(4.48) 



Yx =H 1 X + E 1 
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where Yk for k = 1, • • ■ ,n r is the 1 x TV received data at the kth receive antenna, 
Hk is the 1 x n t channel matrix associated with the kth receive antenna, Ek is 
the 1 x n t noise matrix at the kth receive antenna. 

As usual, for simplicity if we assume the channel matrices associated with 
n r different receive antennas if* (for fc = 1, • • • , n r ) are independent and the 
noise is white (E u - • • , E Ur are uncorrected), then the n r -RX n r TX can be 
broken down into n r number of independent 1-RX n r TX system. Each receive 
antenna has sufficient data to detect the transmitted matrix X so they can work 
individually and achieve transmit diversity of order n £ . To further utilize receive 
diversity we can employ joint ML detection across all n T independent receive 
antennas thus increasing the overall diversity by a factor of n r . Since the n r 
receive antennas are independent at the receiver site, so optimizing the space- 
time encoding scheme for a single receive antenna is equivalent to optimizing a 
system with n r independent receive antennas. 

4.5 Suboptimal Detector for Nonuniform D-STBC 

To detect both the basic and the additional messages in a joint ML fashion 
we have to search through all \X h \ • \X^\ possible combinations to find the 
combination of U h and C/ a which maximize the cost function in (4.30). This may 
be computationally burdensome if the constellation sizes of both layers are large. 
A possible remedy to this problem is to use a suboptimal detection rule described 
as follows 

Step 1. Detect the basic message only using (4.17), neglecting the presence 
of the additional message 

2 

Y[t-l) Y(t) 



UJt) = min 
c/ b (f)e* b 



• n (/ u b (t))n 



max Re {Tr {U b (t)Y H (t)Y(t - 1)}} 



(4.49) 
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Step 2. Detect the additional message assuming the decision taken in Step 1 
is correct 



mm 



Y(t-1)-U b (t) Y(t) 



(4.50) 



Let us elaborate more on why this suboptimal detection scheme works. Since 
C/ a is close to the identity matrix, when compared to the basic message U b it can 
be treated as small unmodelled noise; hence the detection rule in Step 1 can still 
be able to detect the basic message with only a small performance degradation. 
As the error performance of the basic message is typically superior to that of the 
additional message, we can assume the detected basic message is correct when we 
are dealing with the detection of the additional message. Thus the observation 
model for the detection of U^(t) becomes 



Y(t-1)-U b {t) Y(t) 



= HX(t - 1) 



U b (t) U h (t)U>(t) 



+ 



= HX(t - l)U b (t) 



I U>(t) 



+ 



E(t-1) E(t) 
E(t-1) E(t) 



(4.51) 



Treating HX(t — l)U h (t) as an unknown "effective" channel as in (4.29) and 
using (4.17) and (4.30) yields the detection rule in Step 2. 

By applying this suboptimal detection rule the computational complexity 
is reduced from \X h \ • \X & \ to \X h \ + \X A \. For example, for the R h = 2, Z? a = 1 
code, the computational time is reduced from |# b | • |A" a | — 16-4 = 64 to 
\Xb \ + |<*a| = 16 + 4 = 20. For this example the reduction is not very significant; 
yet this suboptimal detection scheme does provide more flexibility as opposed 



to the joint ML detection because if only the basic message is needed, the 
computational load will be further reduced to \X h \ = 16 (for this example). 

We expect that there is some performance loss associated with this subopti- 
mal detection scheme since it does not exploit knowledge of the structure of the 
additional message in the Step 1. 

The Monte- Carlo simulation result in Figure 4-9 shows the performance 
of the suboptimal detector for the R h = 2, R A = 1 code compared to the 
performance of the joint ML detector. The detection of the basic message gives 
a small performance loss 0.3 dB) compared to the joint ML detector, whereas 
the detection of the additional message is (to within practical accuracy) equal to 
that of the joint ML detector. 

4.6 Comparison of Differential Group Codes and A Differential Alamouti Code 
4.6.1 Differential Alamouti Code 

The main advantage of the nonuniform space-time group codes introduced in 
Section 4.4 is that the signal envelope is constant. However, also for this reason 
they cannot achieve the maximum possible performance (in terms of BER) 
because they do not make use of any amplitude information in the signal. On 
the contrary, orthogonal space-time block codes (OSTBC) should provide better 
performance (BER) since they are already optimized by construction [17, 23, 14]. 
For 2 TX antennas OSTBC reduces to Alamouti code. The construction of an 
Alamouti code matrix is as follows 



Si(t) s* 2 (t) 
s 2 (t) -s\(t) 



(4.52) 



where S\ and s 2 are the information bearing symbols and (•)* stands for the 
complex conjugate. For the Rb = 1 system, s\ and $2 are taken from the BPSK 
set {—1,1}, and for the Rb = 2 system, they are taken from the QPSK set 



{— 1, -j, 1, j}. The l/\/2 factor is included for normalization purposes so that 
the transmitted power per time interval equals one. This Alamouti matrix U(t) 
can then be encoded differentially by using (4.28) in Section 4.2. Note that the 
Alamouti code works together with the additional message E/ a taken from (4.39) 
to form a nonuniform constellation, and differentially encoded by (4.32), but 
unfortunately the matrices in the so-obtained constellation do not have constant 
signal envelope. 

Here we compare the performance of the group codes to that of the Alam- 
outi code. The R h = 1 group codes (4.38) in Section 4.4 and Rb = 1 Alamouti 
codes are indeed equivalent, so there is no performance advantage for Alamouti 
codes for this case. The equivalence of both codes is easy to show as in this case 
the set of Alamouti matrices become 
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(4.53) 



* ^ then we obtain the Rb = 1 group 
code in (4.38). 

For the Rb — 2 codes, the Alamouti code does provide somewhat better 
BER performance than the group codes in (4.46) and (4.39). This is so since 
the Alamouti code uses both signal amplitude and phase to transmit informa- 
tion; thus it should be less susceptible to the phase disturbance caused by the 
additional message. Figure 4-10 shows the simulated performance difference 
between the R h = 2, R & = I nonuniform group codes in (4.46) and (4.39) and 
a R h = 2 Alamouti code together with the additional message (4.39). For the 
basic message the Alamouti code outperforms the group code by about ldB, and 
it is somewhat more robust when additional message is added. Meanwhile the 



If we pre-multiply the above set by 4j 
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performances of the detection of the additional message axe equal both when the 
Alamouti and the group codes (4.39) are used as the basic message carrier. 
4.6.2 Alamouti Code with Differentially Encoded Symbols 

It may be argued that both the constant signaling envelope and maximum 
BER performance can be achieved at the same time by using symbol-wise 
differential encoding and forming the Alamouti matrix afterwards. However, 
unfortunately this encoding scheme does not work. This is demonstrated as 
follows. 

Suppose that we employ symbol-wise differential encoding 



Xi(i)=*i(t-l)-*i(t) 
x 2 (t) = x 2 (t-l)-s 2 (t) 

and use these elements to form the Alamouti matrix 



(4.54) 



X(t-l) = 



X(t) = 



x x (t - 1) a?S(t - 1) 
x 2 (t-l) -xl(t-l) 



xi(t) x* 2 (t) 
x 2 (t) -xftt) 



Xi(t-l)Si(t) x* 2 (t - l)s* 2 (t) 

x 2 {t-i)8 2 (t) -*;(*- i)«;(t) 



(4.55) 



The above code matrix has the property that \xi(t)\ = \x2{t)\ = \s\(t)\ = \s 2 (t)\ = 
1 for all t and hence the signal envelope is constant for both transmit antennas 
all the time. Also X(t) is unitary for all t since it is an Alamouti matrix. The 
observation model (4.29) becomes (cf. Section 4.2) 



y(t-i) Y(t) 



H 



X(t-1) X(t) 



+ 



E(t-1) E(t) 



(4.56) 



=HX(t - 1) 



/ X"(t-l)X(t) 



+ 



E(t-1) E(t) 



Employing the noncoherent detection rule (4.30 to the above model (4.56) the 
following detection rule is obtained 
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max Re{ Tr {(X H (t - l)X(t)Y H (t)Y(t - 1))} } 



max 



Re 



\ \[ X2 {t-\) - Xl (t-1) 



xi(t-i).«i(0 x;(t-i)-s;(o 
x 2 (t- 1). 52 (0 -*;(* - 1) - *y(0. 



V H (t)V(t - 1) 



}} 



max 
*l(0,*2<«)€S 



Re 



Tr | I" *i(0 + s 2 (t) x* x (t - l)x* 2 (t - 1) • (a5(t) - ^(0)1 r jf (0y(£ _ 1} 1 ) 



(4.57) 



The above maximization problem (4.57) cannot be solved unless Xi(t — 1) and 
x 2 (t — 1) are known. For example, consider the following two sets of values 



{si(t),s 2 (t),x 1 (t - l)x 2 {t - 1)} - {1, -1, 1} 



and 



{si(t),s 2 (t),x l (t - l)x 2 (t - 1)} - {-1, 1, -1} 
These make the metric in (4.57) equal to 

Y H (t)Y(t - 1) 



(4.58) 



(4.59) 
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| Tr< 










2 0 



(4.60) 



which implies that the sets {51,52} = {1, —1} and {s i5 s 2 } = {—1, 1} are not 
identifiable if X\(t — 1) and x 2 (t — 1) are unknown. 
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Figure 4-9: Monte-Carlo Simulation of the BER performance for the suboptimal 
receiver (4.49) and (4.50) in Section 4.5, for the joint ML receiver and for the 
basic message only system with R b = 2, i? a = 1 code. The curves "— x — " and 
"- x — " overlap. 
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Figure 4-10: Monte Carlo Simulation result for the BER performance of R b = 2, 
Zi a = 1 system, with differential group codes and differential Alamouti codes. 
Note that the solid and dash curves with "x" marks overlap. 



CHAPTER 5 
CONVOLUTIONAL PRE-CODING 

5.1 Introduction to Convolutional Codes 
In Pursley et al. [12], convolutional coding are introduced for extra error 
protection before the data symbols are further encoded by the nonuniform 
PSK modulation. For completeness we introduce here the basic concept of 
convolutional coding as a tool to improve the BER performance. 

Convolutional coding is a kind of forward error correction (FEC) technique, 
which means that no feedback channel is required in contrast to, e.g. Automatic 
Repeat Request (ARQ). 

The idea of convolutional coding is simple: To improve the error perfor- 
mance of a transmission link by adding some carefully designed redundant bits 
to the data before it is transmitted through the channel. This process of adding 
redundancy is known as channel coding. In contrast to block codes which operate 
on large message blocks, convolutional codes operate on a continuous stream of 
data bits. 

Convolutional coding is a FEC technique which is especially suitable to 
channels which are contaminated by additive white gaussian noise (AWGN). By 
using convolutional coding, BER performance can be improved significantly at 
the same SNR, at the cost of lowering the data rate. Meanwhile this rate loss can 
be compensated for by increasing the constellation size. 

There are two important parameters for a particular convolutional codes, 
namely the code rate and the constraint length. The code rate, fc/n, is simply the 
ratio of the number of data bits input to the convolutional encoder (k) to the 
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First output 




Second output 

Figure 5-1: A simple rate 1/2 convolutional encoder. 

number of output bits by the convolutional encoder (n), can also be a measure 
of the efficiency of the code. The constraint length, K, denotes the "length" or 
"memory" of the convolutional encoder, that is, each output bit is affected by K 
previous input bits. 

For years, convolutional coding has been the predominant FEC technique. 
But there is a tradeoff of employing convolutional coding. For example, using 
a 1/2 rate convolutional codes, the data rate will be deducted to half, provided 
that the modulation technique is the same. This is simply because with rate 1/2 
convolutional encoding, two channel symbols are needed to be transmitted per 
one data bit. But the advantage is that, the convolutional coding can improve 
the error performance at the same SNR, at the cost that the data rate will 
decrease by a factor oin/k. 

Figure 5-1 shows a typical rate 1/2 convolutional encoder, built with two 
shift registers and two modulo- 2 adders. 



One should pay attention that an interleaver is usually used to interleave 
the encoded data bit stream from the output of the convolutional encoder. 
The benefit of interleaving the data bits is that it can provide time diversity: 
consecutive bits are likely to be contaminated by a temporary strong interference 
or noise, or sustained by deep fading channel environment, this can make the 
error correcting algorithm at the convolutional decoder fails to correct bit errors 
and recover the correct information. With sufficient interleaving, the originally 
consecutive bits are now separated far apart as they are being transmitted 
through the wireless channel, so that we can reasonably assume that the bits 
nearby each other after de-interleaving have undergone independent fading 
and are affected by independent noise and interference, this is desirable to the 
convolutional decoder as consecutive bits are not likely to encounter errors 
simultaneously. Also the design task of the ML detector at the receiver is also 
simplified. 

5.2 Convolutional Codes Applied to Space-Time Coding System 
We are going to investigate the error performance of the system if a con- 
volutional encoder is employed to the information bit streams prior to STBC 
mapping. It is expected that the BER performance can be improved substan- 
tially by such kind of pre-coding in an adverse channel environment. This simply 
trades the error performance with data rate. 

To start with, we choose a simple rate-1/2 convolutional encoder with 
constraint length if — 3, generators are 5, 7 in octal. 

As for our R\, = 1, R a = 1 code, there are two bits carried in the basic layer 
and two bits carried in the additional layer for each transmit time interval, let 
Si, 5 2 denote the two bits carried in the basic layer and S3, s 4 denote the twos 
bits carried in the additional layer, respectively. And now we have four separate 



(independent) bit streams, for example they may look like the following 

51 = 0, 1,1,0, 1,1, 0,1,1,. 

5 2 = 1,1,0, 1,1,0,0,0,1,-" 

(5.1) 

s 3 = 0, 0,0,1, 0,1, 1,1,1, 

5 4 = 0, 0,1, 0,0, 1,0,1, 0,--- 

We model the 5 1, 52, 53, 5 4 as independent bit streams with random 0 and 1. For 
each bit stream we pass it through a stand-alone binary convolutional encoder 
stated above. After that, we would have four encoded bit streams, denoted by 
5 i> s 2) 5 3> s 4- Notice that the encoded streams are not of the length of the raw 
data streams, for the rate-1/2 encoder used in our study here, the length of the 
encoded streams is doubled, we sacrifice the data rate by half to trade for better 
BER performance. 

The four encoded bit streams are then being passed through the STBC 
mapper as before and sent via the MIMO system. 

5.2.1 Hard-Decision Decoding 

We get the estimated bit streams sY, $Y, sY> $4 at the output of the 
hard-decision symbol detector, we then pass them through the Viterbi decoder 
to decode them back into the raw data streams. BER performance of such a 
hard-decision decoding system is shown comparing to the un-encoded system. 

5.2.2 Soft-Decision Decoding 

For a soft-decision decoder, the input is no longer a precise 0 or 1 binary 
estimation but the likelihood ratio P(c = l)/P(c — 0) of each bits. 

To calculate the likelihood ratio of each individual bit we have to consider 
the receiver structure. For the differential encoding and decoding system in 
section 4.2, to decode the differentially encoded information message, the receiver 



has to consider two consecutive received data blocks 
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Y(t-l) Y(t) 



= HX{t-l) ■ 

y v ' 

effective channel H c 



I U(t) 



E(t-1) E(t) 



(5.2) 



Y e U e Ee 

where Y e and U e stand for the "effective" received data block and the trans- 
mitted matrix, respectively. To detect the transmitted matrix U e given Y e , the 
noncoherent detection technique discussed in section 4.2 can be used. Though 
there is one main difference: For hard-decision decoding in section 4.2 we do 
not need to know the exact value of the likelihood function, but simply choose 
the U e to maximize it. For this purpose we only need to concentrate the func- 
tion against the unknown channel, and simply treat the noise variance a 2 as an 
unknown constant would be good enough. 

However for soft-decision decoding we have to not only maximize the 
likelihood function against U e , but have to evaluate the exact values of the 
likelihood function for any possible transmitted matrix U e) to achieve this we 
have to concentrate the likelihood function against both the unknown channel H 
and the unknown noise variance a 2 . The logarithm of the likelihood function is 
the following [16] 

L(Y e \U e) H ei a 2 ) = -2yVn r log<7 2 - ^Tr{(Y e - H e U e )(Y e - H e U e ) H ) (5.3) 



Maximization of (5.3) with respect to a 2 requires that 
dL{Y e \U e ,H e ,a 2 ) 



da 2 



-0 



-2Nn r ^ + -^Tr {(Y e - H e U e )(Y e - H e U e ) H } = 0 



(5.4) 



which yields a positive solution for a 2 



(5.5) 
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Inserting (5.5) into (5.3) gives 

L x {Y e \U e , H) = - log TV {(Y e - H e U e )(Y e - H e U e ) H ) (5.6) 

We next continue to concentrate the likelihood function to eliminate H e . 
Maximizing (5.6) with respect to H e , we get 

H e = Y e U?(U e U?y l = ±Y e U? (5.7) 

where we used the fact that U e U^ = 21. 

Insertion of (5.7) into (5.6) gives the concentrated likelihood function 



(5.8) 



L(Y e \U e ) = -logTY |(r e - ±Y e U?U e )(Y e - ±Y e Uj?U e ) H } 

= -logTY{y e rf - ±Y e U?U e Y? } 

For a particular information bit, say s\ } the likelihood ratio can then be 
approximated by 

P(Si = 1) Sl/ e : 5 i = i eX P (W.)) 
P(3l = 0) "E^ 0 exp(L(y e |^ 

__ ^U e :si=l 'lL{Y e Y?-(Y e U?U e Y? } 
Et/ e : 5l =0 Tr{Y e Y?-lY e U?U e Y?} 

We can thus evaluate this likelihood ratio for each Si, 52, 53, 54 bits for every 
time intervals and use this soft information bit estimation as the input of the 
soft-decision Viterbi decoder. 

Alternatively, the receiver may use the previously received data blocks to 
estimate the noise variance. Doing so we just need to insert the estimated noise 
variance into the likelihood function, and it is not necessary to concentrate the 
likelihood function with respect to the noise variance. If we estimate the noise 
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variance for a long enough time and take the average value, we will hopefully 
get a very accurate estimation. To get the estimated a 1 from the received data 
blocks we simply need to refer back to (5.5) 

a 2 = ^-Tr {(Y e - H e U e )(Y e - H e U e ) H ) (5.10) 

Substituting (5.7) into (5.10) gives 

* ! = 2k* { (Y ' - krtffiir- " \ Y ^) a } (5 u) 

Substituting the a 2 and H e = \YJJ^ back into (5.3) yields the likelihood 
function with estimated noise variance 

L(Y e \U e ,a 2 ) = -2Nn r log* 2 - ±Tr jy.yf - ±Y e U*U e Y?} (5.12) 

With an accurate a 2 we may expect that this likelihood function performs 
better than the concentrated function with unknown noise variance since there 
is one less unknown variable. Figure 5-2 shows the simulation results of the 
performance of both hard-decision and soft-decision schemes. 
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Figure 5-2: With rate-1/2 convolutional coding, comparison of BER performance 
are shown for hard-decision detection, soft-decision detection with concentrated 
likelihood function, and soft-decision with estimated noise variance. 



CHAPTER 6 
A NETWORK APPLICATION EXAMPLE 

We consider employing our nonuniform space-time codes in a broadcasting 
telecommunication system. First we present a short introduction to contempo- 
rary typical wireless cellular systems. 

The concept of a wireless cellular system is fairly simple: a large region to be 
served axe divided into many small areas called "cells" . Each cell is serviced by 
one or more base station(s) located at the center or the corners of the cell. The 
advantage of such a cellular system is that the available frequencies band can be 
reused in cells which are far apart; thus the same available frequency band can 
now be enjoyed by a small area which may consist of several small cells. 

Figure 6-1 shows a 7-cell frequency reuse scheme. A cluster is formed 
by seven cells. A cluster can use all the available bandwidth, and the whole 
frequency band is being reused in all other clusters. So the whole frequency band 
is now shared by seven cells only, thus each user in the cluster should be able 
to get a adequate amount of usable bandwidth. Since the adjacent clusters use 
exactly the same frequency band, a power control scheme can be employed to 
keep the Signal-to-Interference-Ratio (SIR) under certain threshold. 

Now we are moving on to see what is happening on one particular cell. 
Figure 6-2 shows a typical cell. For simplicity we assume the shape of the cell to 
be circular, and that the base station is located at the center. Let R 0 represents 
the cell radius. Typically the SNR at the cell border should be the lowest. To 
predict the signal strength at various places of the cell, some sort of propagation 
model has to be used. Here we first introduce the free space propagation model 
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Figure 6-1: 7-cell frequency reuse system. 

due to its simplicity. Assuming there exists a Line-Of-Sight (LOS) propagation 
path, then the received signal strength can be predicted by 

d /j\ PtG t G r X 2 ( . 

PM) = 15ov" (61) 

where P t is the transmit power at the transmit antenna, G t and G r are the 
transmit and receive antenna gains respectively, A is the wavelength of the 
transmitted signal, d is the propagation distance. In the previous discussion of 
space-time coding it is assumed that we have a frequency-flat channel, this means 
that we must use a narrowband modulated signal. 

It is well known that the received signal strength attenuates as d~ 2 in free 
space theoretically, but in practical environment for a wireless link, the LOS 
path usually not exists and the propagation may encounter various loss such as 
scattering and other obstructions, so the attenuation is usually much higher. 
Even with a LOS path, the received signal strength usually attenuates faster 
than d~ 2 due to destructive reflection. Another commonly used model is the 



Figure 6-2: Typical cell layout for a wireless telecommunication system. 
log-distance path loss model 



Here P 0 is the received power at a reference point, do is the distance between the 
transmitter and the reference point, and n is the path loss exponent. For ideal 
free space, n = 2. For an urban area we may take n « 3 or 4 [24]. 

Due to the unpredictability of the radio environment the above model can 
only represent the average received power. In fact the actual received power 
fluctuates vastly around the average value since the propagation environment 
varies at different time/space (varying reflective objects, obstructions, weather, 
etc). From a system design point of view the exact level of received power is 
quite hard to predict and this sort of power fluctuation may be modelled as a 
random quantity in a mathematical framework. So the Log-normal Fading model 
is being introduced 




(6.2) 




to 



(6.3) 
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where X a is a zero-mean Gaussian distributed random variable with variance a 2 
(Typical « 6 dB over large/small scale fading). 

Thus we may say that for any point at the cell, the instantaneous SNR is a 
Gaussian random variable with the mean value given by the Log Distance Path 
Loss Model. 



Since now the SNR at any point is modelled as a random variable then the 
instantaneous BER performance is also random, we can define the coverage area 
of a cell 

Define the coverage radius to be the radius of the area such that a specified 
detection performance (in terms of BER) is achievable with a certain probability 
that we call the Quality of Service (QoS). For example, if we let Rq denote 
the 99% coverage radius then within the area of radius Ro the message can be 
detected at a specified BER during 99% the of total time. 

If nonuniform space-time codes are used to increase the data rate for the 
more capable receivers, it is clear that the coverage radius for the transmission of 
the basic message under the same QoS will decrease to i?& < Rq. Meanwhile the 
coverage radius of the transmission of the additional message now becomes R a ] 
of course its value depends on QoS for the additional message. For the simplicity 
of our analysis here let us assume the same QoS for both the basic and the 
additional message. 

Denote the "tolerable" performance loss for the detection of the basic 
message as A dB. Assuming log-distance fading it can be shown that 



SNR = SNR + X a 



(6.4) 




(6.5) 



|§ = Mr** (6.6) 

where T is the performance difference (in dB) between the detection of the 
additional message and that of the standard uniform constellation system. The 
parameter n is the path loss exponent for the radio link environment (i.e. Power 
oc (distance)" 71 ; for details see Rappaport et al. [24]). The above equations 
are also valid in a log-normal fading model. It is not difficult to obtain the 
result above: Denote the SNR required to have a certain BER performance as 
SNR th [dB}. Then, for example, the 99% coverage radius Rq can be determined 
as it satisfies 

P[P^Ro)[dB] + X sigma > SNR ih [dB}] = 99% (6.7) 

When we apply the nonuniform modulation the BER performance of the basic 
message deteriorates by A dB, which means that it requires aAd5 higher SNR 
to obtain the same performance, so 

P[FUJikj[dB] + X sigma > SNR ih [dB] + A] - 99% (6.8) 

Comparing (6.7) and (6.8) we get 

P^Rtj [dB] = P^Ro) [dB] + A 

E (6.9) 
=>101og 10 P 0 [dB] - lOn log{-±) = 101og 10 P 0 [dB] + A 

and (6.5) follows immediately, (6.6) can be shown similarly. 

Figure 6-3 shows how i?& and R a vary with A for the R h = 1, i? a = 1 
system, with the power attenuation factor of the radio link taken to be n = 2 and 
n = 4; these are typical values for free-space and downtown areas, respectively 
[24] . It can be seen that employing the nonuniform constellation does not 
increase the total throughput of the system (i.e., the sum of the data rate for all 
users) in the case n = 2. For example, if the coverage area for the basic message 
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Figure 6-3: Coverage area for the basic/additional messages as a function of the 
"tolerable" performance loss, for the R b = 1, R & = 1 code. The horizontal curves 
represent ,the relative coverage area for the R h = 2 code with a basic message 
only. 

is allowed to decrease to 0.8 times of that of the original cell area, the coverage 
area of the additional message can only reach 0.15 times of that of the original 
cell area, which results in a decrease in overall system throughput. However for 
the case n — 4, the use of a nonuniform code increases the total throughput. This 
observation is not hard to understand; for n — 4 the area closely surrounding 
the base station enjoys a relatively much higher SNR than the area close to the 
border of the cell and hence the coverage area for the additional can be extended 
further. Note that the coverage areas of the nonuniform layered code (4.38) and 
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Figure 6-4: Coverage area for the SISO basic/additional messages as a function 
of the "tolerable" performance loss. The horizontal curves represent the relative 
coverage area for the SISO rate-2 code with a basic message only. 

(4.39) (with R b = R a = 1) tend to approach that of the R b = 2 code (4.46) when 
A increases. 

The performance advantage of a 2-TX system upon 1-TX system is illus- 
trated in Figure 6-4. The coverage area of the 1-TX system is only about 0.2 
times as that of a 2-TX system for n = 4. 



CHAPTER 7 

CONCLUDING REMARKS AND FUTURE WORKS 
We have presented new nonuniform space-time codes that can be encoded 
and detected differentially, and that are based entirely on phase-shift keying. 
We also discussed analytical criteria for code construction and optimization, 
and we compared its performance with that of a scheme based on the Alamouti 
code. We also studied a suboptimal detector and its performance, which we 
found to be satisfactory (a loss within 0.3 dB compared to joint ML decoding). 
We also demonstrated how nonuniform space-time codes in a broadcasting 
telecommunication system can increase the total throughput. 

It may be argued that using already established nonuniform constellations 
for single transmit antenna systems together with, for instance, known linear 
space-time codes should be a natural approach to the problem of designing 
nonuniform space-time constellations. However, there are at least two problems 
associated with such an approach. First, it may in general not be optimal, 
simply because we are optimizing over the class of space-time codes and the class 
of nonuniform single-antenna constellations separately, instead of optimizing over 
the class of nonuniform MIMO constellations. Second, we found it difficult to 
incorporate constraints (such as constant envelope after differential encoding), 
that are desired or required from a practical implementation point of view. 
Therefore, we believe that it may be advantageous to design nonuniform space- 
time constellations by approaching the problem from first principles. 

In this thesis, the nonuniform space- time block codes are chosen in quite 
an "arbitrary" way, and they are optimized with the help of chernoff bounds of 
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BER, which are always loose. We can expect the "optimal" nonuniform group 
codes suggested in this paper are indeed not optimal. Future works may include 

1. Calculation of exact theoretical BER 

2. Designing the nonuniform group codes in a "more" systematical way 

3. Compare the performance of our nonuniform STBC to the theoretical 
capacity bound of the broadcast system 
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